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SUBJECT: AUTOMATIC ASSEMBLY OF PROGBAMS 

To: Air Defense and Applications Groups 

From: John W* Carr III 

Date: April 23, 1952 

Abstract: A study* of methods of programming with the present and proposed 
conversion programs shows that most programs, with present 
methods, can he written in such a fashion that they can "be 
assembled automatically, with no calculation of storage posi- 
tions on the part of the programmer* 

General Discussion 

Eie philosophy of this memorandum is diametrically opposed to 
that of the programmer who uses octal notation, likes itV and Is con- 
vinced that this is the most satisfactory way to write a program* 
Nevertheless, under certain conditions, the method proposed here may 
prove to "be a much more speedy, more easily corrected way to give com- 
plete machine instructions* To the programmer who is already using 
decimal notation and preset parameters, the proposed methods are merely 
a logical extension of certain devices he has already been using* 

This method of assembly, used with the Whirlwind conversion 
program as of this date (or most probably with any future conversion 
program) will parallel the general method of coding first proposed by 
von Neumann and Goldstine (l)*, but which up to now has been impossible 
to use* 

It is very similar to the "free-" or ° floating-address" 
device first proposed by Wilkes at the informal programming conference 
at the Philadelphia A.I.E»E.-I.B*E.-A.C.M. meeting* (2). With a few 
changes in the notation of the present control combinations of the 
conversion program, and the addition of a few extra control combinations, 
all possible programs should be handled satisfactorily* With the 
present conversion program, any program not using multi-register number 
storage can be assembled completely automatically* Programs using 
multi-register number storage will still require a small amount of 
program layout on the part of the programmer, at least until one or - 
two more control combinations are added to the conversion program* 



Humbers in parentheses refer to the Bibliography at the end of the paper* 



Memorandum M-1445 Page 2 



The proposed method of assembly is most useful for programs 
that use subroutines from a given subroutine library. However, with 
a conversion program that provides sufficient preset parameters, it 
can "be very useful in programming anv program, whether or not it uses 
subroutines* 



Method of Assembly 

In the original octal (and later decimal) programming methods, 
each storage register in the machine was known by a preassigned number, 
which was the direct equivalent of the "binary number appearing in the 
machine program counter. Under the new proposed method, storage loca- 
tions are to be known in the outside programmer's language "by "names, " 
which will have no direct connection with the corresponding address in 
the machine, except through the translation or conversion program 
itself. 

Thus, for example, at present the first address of a print- 
subroutine is known on the outside as i5. This actually implies that 
the address of the subroutine is stored in the x5 parameter, and upon 
read-in any address printed on tape as x5 will be converted to yield 
the machine address stored in preset parameter x5. 

However, up until now, programs converted from paper tape 
have not been assembled automatically, even though the preset parameter 
technique has been u$ed„ to some extent for addresses* What has been 
needed is the use of a * two-pass 11 system of paper tape read-in. Such 
a system was originally planned for the next conversion program, using 
magnetic tape, but lack of a suitable line-by-line photoelectric reader 
made postponement necessary. What was not immediately understood, 
however, was that a two-pass system was possible with the present 
paper-tape direct conversion program. Now that an experimental program 
has been written and operated using the proposed automatic assembly 
method, a "two-pass" paper tape method may become useful at least until 
a M two-pass" or "multi-pass" magnetic tape scheme is perfected. 

The basis of the method is as follows: main program, sub- 
routines, constants, and temporary storage registers are stored indis- 
criminately (with the exceptions noted below) one after another in 
storage. The addresses of any given register (such as first address 
of a subroutine, or any register referred to by another instruction) 
may be given a "name" corresponding to one or the preset parameters. 
It it* olum rdlbi'ieu oo, oy that preset parameter, in the normal fashion. 
This address is thereafter referred to by this "name." Instructions, 
constants, subroutines, etc., are stored consecutively in the machine, 
so that there are no gaps in the used registers in the machine. 

However, since some programs of necessity refer to addresses 
which have not yet been assigned to the corresponding preset parameters, 
two "passes" along the tape must be made. The first read-in automatically 

determine?? the petition of each register in storage. . When any register 



Memorandum M-1445 Page 3 



has a "name" given "by the corresponding parameter, this value is auto- 
matically assigned. On the first read- in, some of the addresses corres- 
ponding to their "named" registers will he incorrect, because they refer 
to preset parameters that have not yet "been assigned addresses* On the 
second read-in, however, after the correct addresses have "been assigned 
to each preset parameter in use, the correct addresses will he inserted 
into the registers calling for addition of the required preset parameters* 

s JJ¥flf- length Assembly 

Because assembly of single-length programs does not involve 
calculation of the separation constant K and use of the present special 
multi-length input programs, we will consider such an assembly first. 
The following rules will apply: 

1. Temporary registers will he assigned last, so that 
the question of how many temporary registers are 
needed will not enter (except in the question of 
whether storage has heen exceeded)* 

2. All other storage may he placed in any sequence* 

3. Any address which is referred to hy any instruction 
may be assigned a preset-parameter "name*" 

4* All other addresses are considered to he "nameless," 
although they may he referred to if needed. 

5* Assignment of registers is made automatically hy 
control combinations as follows. The "current 
storage address" is placed in the relative address 
parameter storage (r storage) by the control com- 
bination "0/*" The current address is then trans- 
ferred to the particular preset parameter storage 
by another control combination. For example, if a 
particular address is to be "named" by parameter vl, 
then the current address would be stored by means 
of the combination 

0/vl/pOr. 

At the end of this sequence on the tape the current 
address weald be stored in the vl parameter, since 
vl/pOr transfers the contents of the relative 
address storage into vl* 

An example of a single length program coded by this automatic 
assembly method, and using the "floating-address" scheme, is the tape 
T-1070-1. 
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ifkm 10,pl,p0,p7,p0,p31,pl ,132 



flO 



ifkm 10/132 



0/vxl/pOr. 

caax2 
*dax3 

0/vx3/p0r. 

C/ 3 

eaO 

spax4 

aoax3 

suax5 

cpax3 

rsO 

0/vx4/p0r. 

vl/ tal9r. • • • 

48/p49 
0/vx6/p0r. 

po 

Pi 
p2 
P3 
P* 
p5 
p6 

P7 

0/vx5/p0r. 

ca7az6 

0/vx2/p0r« 

pax6 

0/v/pOr. 

faxl 



l/ts25r»«» 



J.fkm: 5-5-6 form for sp 844 -\ These 

(control to direct 1 control 
conversion) / comMnatiom 

10,: Decimal addresses, K print 

erase storage the 

pl,p0,p7,p0,p31,pl,: T 1070-1 tape 

132: Begin storing at J title 
register 32 

flO: Return control to 5-5-6 input 
program at reg. 10 

ifkm: 5-5-6 form for sp 844 \ Second 

(control to direct ) input title 
conversion program) / for 
10/: Decimal addresses, do ( second pass 

not erase* \ with 

132: Begin storing at reg. 32] tape 

Put current address (32) in 
preset parameter vxl. 

Put current address (34) in 
preset parameter rac3. 



Put current address (40) in 
preset parameter vx4» 



Put current address (89) in 
preset parameter vx6» 



(subroutine fo 
\ printing 
(integers. 



Put current address (97) in 

preset parameter vz5* 
Put current address (98) in 

preset parameter vx2. 
Put current address (99) in 

preset parameter v» 
He turn control to the address stored in 

preset parameter xl. 
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This program is supposed, as a test, to print out 0,1,2, ...7, 
in that order* It was coded successively from "beginning to end, without 
the necessity of ass igning' any numerical addresses to storage. After 
the first "pass" of the tape through the photoelectric reader, the 
following will De stored in the machine: 

32 ca 

33 td 34 

34 ca 

35 sp 

36 ao 34 

37 su 

38 cp 34 

39 rs 

40 - 88 (subroutine) , 

89 pO 

90 pi 

91 p2 

92 p3 

93 p4 

94 p5 . _. . 

95 p6 

96 p7 

97 ca 96 

98 p89 

The contents of the preset parameters storage will "be as follows: 

vxl/p32 
Tx2/p98 
vx3/p34 
vx4/p40 
vx5/p97 
vx6/p89 
v/p99 

Thus at the end of the first "pass," all preset parameters will have 
the correct value. 
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For the second pass, the tape must he read in starting 
immediately after the flO, since at present the conversion program 
uses preset parameter storage for printing the tape number, which 
would write over whatever had been stored on the first pass* At the 
end of the second read-in, storage would be as follows: 

32 ca 98 

33 td 34 

34 ca 

35 sp 40 

36 ao 34 

37 su 97 

38 cp 34 

39 rs 

40 - 88 (subroutine) * 

89 pO 

90 pi 

91 p2 

92 p3 

93 p4 

94 p5 

95 p6 

96 p7 

97 ca 96 

98 p89 

99 • ••(on) temporary storage d,lt,*«* 

Corrections 

This formulation is very easy to correct, in case of an error, 
both during programming and on the tape* For example, if during 
programming it is found that several registers have been omitted, they 
can be inserted directly in the proper place in sequence, without 
having to renumber all the successive registers and their addresses* 
For example, in the program above » if the programmer wanted to stop 
printing at the first negative number to he called in, immediately 
after the caax2 he could insert cpax7 . and before the rsO . 0/vx7/-pQr » 
Aside from this, no renumbering would be necessary* 
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00^16 Precision and Floating Point Numbers 

As of the time of writing, the situation as regards read- in 
and conversion of Double Precision and Floating Point numbers is not 
helpful in automatizing storage in those cases where those types of 
numbers and Interpretive subroutines are used* At present, two con- 
version programs are necessary, one for single-length words, and a 
second for multi-length number storage* The second input program 
at the present time also unfortunately uses preset parameter storage 
vl,*».vl5 as temporary storage, and thus would change the contents of 
those registers arbitrarily on a second "pass." Finally, because the 
final portion of storage is used for the various conversion programs, 
and temporary registers must now be double- or multi-length, other 
difficulties are inherent* 

For this reason, we shall present an automatic assembly 
plan making use of the new magnetic tape conversion program, now 
being written. This program has the important properties that preset 
parameter storage is not used to store characters to be typed, or as 
a temporary storage* It will also have more preset^ parameters avail- 
able for use* 

The new conversion program will contain a "multi-length 
current address" register, which is tentatively scheduled to be the 
vxl parameter* Thus, to give a multi-register number a "name," the 
following control combination could be used to place the "multi- 
length current address into the vx5 preset parameter, for example: 

vx5/pazl* 

The indexing of the "multi-length current address" is thus done 
automatically* 

The problem of temporary storage, which was solved, or more 
truthfully avoided in the single-length case by placing the temporary 
registers last, is a stumbling block in the multi-precision case* 
The separation constant can be calculated only when the number of 
multi-length registers depends on the number of temporary registers. 
However, temporary registers, at present, may be either single- or 
multi-length, and there is no control combination or automatic section 
of the conversion program that can determine the maximum number of 
temporary registers* Until such a change is made in the input program, 
the following scheme is proposed: 

1* Multi-length constants, of every sort, must be 
stored in one block, last in the automatic 
assembly section of storage* 

2. Temporary registers must be placed last in 
the group of multi-length registers. 

3. Any address which is referred to by any instruc- 
tion may be assigned a preset parameter "name." 
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Single-length words will "be assembled first, with the 
exception of temporary storage, which will "be considered to consist 
of all multi-length storage* Following the last single-length word, 
a preset parameter "name" should be given to the first multi-length 
address for some preset parameter (such as vx4, for example) by the 
control combination 

0/vx4/p0r vxl/pax4. 

This then sets the beginning of double-length storage to the proper 
place* Any number referred to later on in the group of multi-length 
storage can be "named by 

vx5/paxl»..*,etc* . 

Following the last multi-length number, the separation parameter is 
calculated automatically by 

vx2/paxlsx4 t 

where vx4 is the parameter that "names" the first multi-length reg- 
ister* Thus vx2 contains the difference between the contents of the 
vxl and vx4 parameters* 

However, if temporary storage is to be included, the vx2 
parameter should be calculated automatically by 

vx2/pmax/sx4, 

where m is the maximum number of temporary registers used* 

We can ^hus give a short example of automatic assembly of 
a multi-length program as follows, assuming now that there will be 
no use of preset parameter storage for other purposes* Under the 
present multi-length separation scheme, m would have to equal "maxi- 
mum number of temporary storage registers used plus total number of 
floating subroutine (vxl) storage used." 
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10, Title , 132 

0/-oS/pOr 
ca 

td ax4 

i 
i 

I 

0/vx4/p0r. 

ca 
ts ax5 



Single-length words 



etc* 

0/vx6/p0r. Trxl/pax6i 



Subroutines with double 
length constants 
gaxL c $ 

po 

gazlax2 

nO 

▼lOO/paxl . TCl/plalOO » 



rc7/paxl. 

+.1/+1 

i 

i 

i 
TO8/paxl. 

+.5/+1 

I 
i 

v/paxl. 

pmaxlsx? • 
fax3 



Double-length numbers 



Temporary storage 



J 
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Cautions In Use 

At present, the preset parameter scheme is not completely 
satisfactory for such a scheme, since there are not enough permanent 
(vx) parameters available for a complete assembly* Use can he made 
of the temporary (▼) parameters, hut it should he noted that many of 
these are changed during read-in of subroutines. 

The programmer should also guard against other changing of 
the preset parameters, -such as on read- in during the printing of the 
•tape title, and on read-in of multi-length numbers with the present 
(special) multi-length conversion program. 

A good rule to follow until some specific final policy is 
made concerning preset parameters, is that only vx parameters may he 
considered unchanged during the course of a program. However, other 
parameters may he used if the subroutine situation is investigated. 

It also may he possible, in many cases, hy Judicious arrange- 
ment of the program's components, to make the assembly process auto- 
matic on only one pass. This requires that the maximum number of 
temporary registers he known, and that all preset parameters referred 
to in instructions have their addresses assigned previously. 



Hew Control Combinations 

Several new control combinations are ohviously called for 
if this scheme is to he completely useful. One is a single combination 
to replace the group 0/vx4/p0r., for example. This might he w kx4, n 
meaning "keep the current address in the x4 parameter . n 

Another might he incorporated in the f control combination. 
This would check to see if the current address were still less than 
or equal to a "final available register address." If it were not, the 
conversion program in some fashion could say, "not enough available 
storage." 

A useful change would be to set up a third set of assembly 
parameters that would replace v, vx, vxl, vx2, vx3, etc., just as t 
is not now included among them. The "floating address parameters" 
would not then be cluttered up with parameters basically used for 
other purposes. 

Finally, the question of temporary storage remains. One 
way to solve it would be for the conversion program to store the 
"name" of the largest temporary register in some "temporary storage 
length parameter" and to check each' successive temporary register 
encountered to see if its name is larger. This value could then be 
inserted automatically in the proper place in place of the m mentioned 
above. 
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Another way would "be to store the number of temporary 
storage registers used "by a subroutine as one of the preset parameters 
on the subroutine tape. The conversion program could note the largest 
of these* 

On multi-length programs, some decision must "be made con- 
cerning the assignment to single- and multi-length temporary storage. 
One solution is, for all multi-length subroutines to use all temporary 
storage registers multi-length, numbering the single-length temporary 
registers d, dax2. It, ltax2, etc., successively, so as to save storage 
space* 

The use of the d temporary register, with its exceptional 
character, should perhaps be reconsidered, since it prevents use of 
an Ot multi-length temporary register, and is therefore often waste- 
ful of storage* 



Special Aids 

Because this method of assembly makes use of a completely 
different nomenclature for registers outside of the machine, it is 
important that satisfactory retranslation schemes be made available* 
At present there is no post mortem program that can yield a suitable 
retranslation* However, it is apparent that with certain provisos 
a suitable retranslation post mortem might be easily written. 

It might thus be useful to reclassify the preset parameters 
into further subdivisions, for example, "single-length constants," 
"subroutines," "multi-length constants," "main program." Then addresses 
which occurred within a certain block (between two particular preset 
parameters) could be reprinted exactly as they were put in. An alter- 
native method is to include the form of each word with its storage, 
for example. 

For the moment, two tapes entitled "Print Out Preset Para- 
meters," T-1132 and T-1133, are available in 5-5-6 form. After read- 
in of the program, with the direct conversion program, the tape is 
read in over the conversion program and prints out the preset para- 
meters in a standard form. The space is then available for the 
multi-length interpretive and print program, if necessary. Tape 
T-1032 is read in over registers 32 - 200, Tape T-1033 over registers 
734-on (the conversion program). 

Conclusions 

At present, single-length automatic assembly is possible, 
with, however, no checks on overflow of storage. Multi-length com- 
pletely automatic storage awaits the new conversion program and further 
additional control combinations, as well as decisions on temporary 
storage assignments* 
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All such automatic assembly methods depend on two passes 
of the paper tape, or judicious placing of components. Pinal auto- 
matic assembly in the most efficient fashion must await a "two-pass" 
magnetic tape or magnetic drum scheme* 
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